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ABSTRACT 

The scaling of observable properties of galaxy clusters with mass evolves with time. As¬ 
sessing the role of the evolution is crucial to study the formation and evolution of massive 
halos and to avoid biases in the calibration. We present a general method to infer the mass 
and the redshift dependence, and the time-evolving intrinsic scatter of the mass-observable 
relations. The procedure self-calibrates the redshift dependent completeness function of the 
sample. The intrinsic scatter in the mass estimates used to calibrate the relation is consid¬ 
ered too. We apply the method to the scaling of mass Ma versus line of sight galaxy ve¬ 
locity dispersion a^, optical richness, X-ray luminosity. Lx, and Sunyaev-Zel’dovich signal. 
Masses were calibrated with weak lensing measurements. The measured relations are in good 
agreement with time and mass dependencies predicted in the self-similar scenario of struc¬ 
ture formation. The lone exception is the Lx-Ma relation whose time evolution is negative 
in agreement with formation scenarios with additional radiative cooling and uniform preheat¬ 
ing at high redshift. The intrinsic scatter in the ctv-Ma relation is notably small, of order 
of 14 per cent. Robust predictions on the observed properties of the galaxy clusters in the 
CLASH sample are provided as cases of study. Catalogs and scripts are publicly available at 
http://pico.bo.astro.it/-sereno/CoMaLit/ 
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1 INTRODUCTION 

Scaling relations among cluster global properties embody impor¬ 
tant clues on the formation and evolution of cosmic structures. They 
result from the main gravitational processes driving the cluster evo¬ 
lution jKaiser|1986[|Giodini et al.|2013[[Batta g lia et al.|2012[[Ettori| 
|2013| l. Accurate mass-observahle relations are also needed to use 
the abundance of galaxy clusters to constrain cosmological param¬ 
eters jVikhlinin et al.|200^|Mantz et al.|2010b||2014l|Rozo et al.| 
|2010[[Hanck Collaboration et al.|2014b| >. 

This paper is the fourth in a series titled ‘CoMaLit’ (Com¬ 
paring MAsses in LITerature), which aims to assess our present 
capability to measure cluster masses, and to develop methods to 
measure scaling relations through Bayesian techniques. In the first 
paper ( [Sereno & Ettori|2014| CoMaLit-I), we evaluated systematic 
differences in lensing and X-ray masses obtained from independent 
analyses and we quantified the overall level of bias and intrinsic 
scatter of these mass proxies. The second paper presented the for¬ 
malism to calibrate an observable cluster property against the clus¬ 
ter mass and applied the methodology to the Sunyaev-Zel’dovich 
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(SZ) Planck selected clusters ( [Sereno, Ettori & Moscardini|2014| 
CoMaLit-II). The Literature Catalogs of weak Lensing Clusters 
(LC^), which are standardised and homogenised compilations of 
clusters and groups with weak lensing (WL) mass estimates, were 
presented in the third paper jSereno|2014| CoMaLit-III). Here, we 
extend the Bayesian approach to account for redshift evolution of 
the scaling relations, of the intrinsic scatters in the mass and in the 
observable, and of the selection function. 

If gravity is the dominant process, the resulting self-similar 
model predicts scaling relations in form of scale-free power laws 
( |Kaiser|1986 [ Giodini et al.|2013t . Numerical simulations ( jStanekj 
jet al.|201^|Fahjan et al.|20fT| ( confirmed these scalings and showed 
that intrinsic scatters around the median relations approximately 
follow a log-normal distrihution. This basic theoretical scheme 
is very successful in describing observed scaling relations in X- 
ray and SZ ( [Ettori |[MT3| [2015^ . Deviations from the self-similar 
scheme may indicate that non-gravitational processes, such as feed- 
hack and non-thermal processes, contribute significantly to the 
global energy budget in clusters ([Maughan et al.|2012|l. 
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The precise measurement of the redshift evolution of the scal¬ 
ing relations is then crucial to understand how either gravitational 
or non-gravitational phenomena drive the formation and evolu- 
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tion of clusters. Furthermore, if the time evolution is neglected or 
wrongly shaped, the estimated scaling with mass may he biased 
in cluster samples spanning a significant redshift range ( [Andreon] 
|20T4l l. 

Any real time-dependence in the scaling of observables with 
mass and redshift has to be separated by other effects connected 
either to the evolution of the cluster mass function or to the redshift 
dependence of the selection function and of the completeness of the 
sample. Further complications are due to the fact that usual samples 
of clusters are not assembled according to well defined criteria but 
they may be just heterogeneous collections of systems with high 
quality data. In this case, the determination of the selection function 
is very problematic. 

Here we develop a method that measures at the same time 
the evolution of the scaling relation and the completeness/selection 
function of the sample. This is a self-calibrating method which is 
intended to be optimised in large optical survey such as Euclid 
( [Laureijs et al.|20TT| l. If we calibrate a cluster observable against 
the cluster mass, this relation can be used to construct a mass proxy 
based on the observable. The optimal mass proxy is expected to 
be easy to measure, unbiased, and minimally scattered. A crucial 
aspect is that in the first step we cannot calibrate the observable 
against the true mass (which cannot be measured), but we have to 
rely on another mass proxy, such as the WL mass or the hydro¬ 
static mass, which are scattered too ( |Rasi a et al.||2012 CoMaLi t- 
i|. This scatter has to be considered to avoid biases (|Mantz et al. 
201 4|CoMaGFTl |CoMaLiTTlt 

We apply the method to calibrate mass proxies based on the 
line of sight velocity dispersion of cluster galaxies, which is sup¬ 
posedly the best mass proxy jStanek et al.|2010{|Saro et al.|2013| ), 
and three other observables, which are more scattered but optimised 
for large surveys, i.e., the optical richness, the X-ray luminosity, 
and the Sunyaev-Zel’dovich integrated Compton parameter. 

The paper is organised as follows. Section|^is devoted to gen¬ 
eral considerations on the redshift evolution of the scaling relations, 
of the intrinsic scatters, and of the selection/completeness function. 
Section reviews the methodology employed to perform the re¬ 
gression and to recover at the same time the scaling relations and 
the completeness and selection functions. The cluster catalogs used 
in the analysis are introduced in Sec. Results are presented in 
Sec. [^whereas Sec.|^is devoted to the comparison with theoretical 
predictions and previous works. Final considerations are contained 
in the Sec. In App. we discuss how the masses of clusters 
in selected samples are usually distributed. Appendix describe 
the format of the compiled catalogs of line-of-sight velocity disper¬ 
sions. 

Throughout the CoMaLit series of papers, we have been 
adopting the following conventions and notations. The frame¬ 
work cosmological model is the concordance flat ACDM uni¬ 
verse with density parameter Hm = 0.3, and Hubble constant 
Hq = 70 kms“^Mpc“^. H{z) is the redshift dependent Hub¬ 
ble parameter and Ez = H{z)/Ho- When Ho is not specified, h is 
the Hubble constant in units of 100 km s“^Mpc“^. 

Oa denotes a global property of the cluster measured within 
the radius which encloses a mean over-density of A times the crit¬ 
ical density at the cluster redshift, pcr = 3H{zf/{8nG). Tog’ is 
the logarithm to base 10 and Tn’ is the natural logarithm. 


2 REDSHIFT EVOLUTION 

Scaling relations evolve with redshift. Numerical simulations 
( [Stanek et al.|2010) and theoretical predictions ( [Giodini et al.|2013| > 
agree that the relation between the mass Ma and any observable 
quantity O can be summarised by the form 

O oc MiE2. (1) 

Within this framework, the redshift evolution in the median scaling 
relation is accounted for by the factor E2 whereas the slope /? is 
redshift independent. In fact, in the self-similar scenario, the evolu¬ 
tion does not depend on the mass scale and only the normalisation 
depends on cosmic time. 

We tested this scheme in a number of cases. We considered ob¬ 
servables connected either to the galaxy distribution, i.e., velocity 
dispersion of galaxies along the line of sight, Cv, or optical rich¬ 
ness, A, or to the intra-cluster medium, i.e., the bolometric X-ray 
luminosity. Lx, or the spherically integrated SZ Compton signal, 
Ysz- In the self-similar scenario, we expect that for clusters in equi¬ 
librium the scalings go like jGiodini et al.|2013]|Ettori|2015^ 
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where Da is the angular diameter distance to the cluster. The above 
self-similar scaling relations evolve with redshift as E]^'^, with 
7sa = 1/3, 0, 7/3, or 2/3 for the galaxy velocity dispersion, the 
optical richness, the X-ray luminosity, or the spherical SZ signal, 
respectively. The scaling of the X-ray luminosity depends on the 
energy band. For the soft X-ray luminosity in the rest-frame energy 
band [0.1 — 2.4] keV, 1^® evolution can be expressed as 

( |Ettori|2015| l, 

oc ElMA- ( 6 ) 

Together with the redshift dependence of the median relation, 
the intrinsic scatter of the relation and the scatter between the true 
mass and the mass proxy used to calibrate the relation may evolve 
as well. Furthermore, any apparent redshift evolution of the scal¬ 
ing may be degenerate with the evolution of either the mass or the 
selection function. We discuss these aspects in the following. 

2.1 Intrinsic scatter 

Broadly speaking, the intrinsic scatter of a scaling relation is re¬ 
lated to the degree of regularity of the clusters. The larger the devi¬ 
ations from dynamical/hydrostatic equilibrium the larger the scatter 
( [Fabjan et al.|20lT||Saro et al.|2013) ). Scatter is then prominent in 
morphologically complex halos. Triaxiality is another major source 
of scatter, since clusters are usually studied under the simplifying 
assumption of spherical symmetry ([Limousin et al.||2013| [Sereno] 
[et al.[2013l l. Since high redshift clusters are more irregular and less 
spherical, the scatter is usually expected to increase with redshift. 

Let us consider the evolution of scatter in a number of cases. 
Based on numerical simulations, [Saro et al.H2013l ( showed that the 
scatter of dynamical mass estimates based on the line-of-sight ve¬ 
locity dispersion is approximately log-normal and that it increases 
with redshift as 

f’'iog(M„^/Mvir) — 0.13(1 + 0.25z). (7) 

They argued that the dominant contributor to the scatter is the in¬ 
trinsic triaxial structure of halos and that its evolution with redshift 
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is also the dominant source of the increasing scatter of the ID dy¬ 
namical mass estimates with redshift. 

|Fabjan et al.| ( | 20 lT| l studied the scaling relations between the 
cluster mass and some proxies based on X-ray quantities with a set 
of cosmological hydrodynamical simulations. They found that the 
scatter distribution around the best-fitting relations is always close 
to a log-normal one and that the scatter increases with redshift. 

The precise quantitative estimate of the scatter and of its evo¬ 
lution strongly depends on the details of the baryonic physics in¬ 
cluded in the simulations. We considered the results of |Fabjan et al.| 
( | 201 1 ^ for runs with non-radiative physics and standard viscosity. 
To study the time evolution, we fitted the values of the scatter ob¬ 
tained at different redshifts {z = 0.0, 0.25, 0.50, 0.80, and 1.0) 
under the assumption of self-similar scaling relation jFabjan et al.| 
|201 1[ tables 1, 2, and 3). The mass proxy is based on lx, 
i.e., the product of the gas mass within rsoo and the spectroscopic 
temperature outside the core ^Kravtsov, Vikhlinin & Nagai[2006| >. 
We found that the scatter evolves as 

<^iog(MY^/M^oo) — 0-03i?°'^®, (8) 

or, with an alternative forn{5 

f’'iog(My-^/M 5 oo) — 0-03(1 + 0.14z}. (9) 

For the mass proxy based on the emission-weighted tempera¬ 
ture, we found 

criog(Mr^^/M 5 oo) — 0-06i?“'^®, ( 10 ) 

or 

/Msoo) - 0-06(1 + 0.19«). (11) 

The above results show that the scatter mildly increases with 
redshift. This suggest that the evolution of the scatter can be mod¬ 
elled as 

CToIj. = (12) 

2.2 Completeness 

The completeness of a sample usually evolves with redshift. Very 
massive clusters are rare and difficult to be found in the local vol¬ 
ume but they are still forming at high redshift. On the other hand, 
only clusters emitting very strong signals can be detected to very 
large distances. As detailed in App. the selection and the mass 
functions conjure to make the distribution of true masses in ob¬ 
served samples fairly unimodal. Tbe evolution of the completeness 
of the sample can be characterised through the evolution of the peak 
and of the dispersion of this distribution. 

The mean (logarithmic) mass of the sample is connected to 
the observational threshold (see App. S’ which may evolve with 
redshift, and to the scatter between the mass and the observable 
quantity used to select the clusters, which evolves too. 

Let us first consider the evolution of the mass corresponding 
to a completeness limit. As a first example let us consider a flux- 
selected sample. The luminosity scales with mass as 

La oc M^E2. (13) 

^ The function (1 + , i.e., ~ (1 + 712 ) for 2 ^ 1 , can approximate 

Ez in small redshift intervals. The coefficient 71 used in the approximation 
depends on the redshift range considered and on the cosmological parame¬ 
ters. 


If we select only clusters above a limiting flux, /th, the correspond¬ 
ing luminosity evolves as Lth{z) oc , where Di^{z) is 

the luminosity distance. In absence of scatter, the corresponding 
limiting mass evolves as 

Mth oc Dl ( 2 ) ? i?z ^. (14) 

As a second example, let us consider a SZ-like signal, whose 
size increases with the projected physical surface covered by the 
cluster. In this case, the observable is proportional to Da{z)^0\, 
where 6 'a is the angular extension of the cluster. The scaling can be 
written as 

DAizfYA oc M^E:. (15) 

The noise is proportional to the square root of the angular area, i.e., 
(Tya oc 0a = ta/Da- In absence of scatter, if we select only 
clusters above a given signal-to-noise ratio (SNR), i.e., Ya/o-y^ > 
SNRth, the corresponding threshold mass evolves with redshift as 

Mtb o: Da{z)^E 2-^r (16) 

The two above examples suggest that the evolution of the mass 
at a given completeness limit can be parametrized as 

Mth ocDa(2)^°£;2®% (17) 

where the factor E2^^ accounts for the evolution in both the mass 
threshold and the intrinsic scatter of the scaling relation. This mod¬ 
elling of the completeness limit was derived for samples selected 
with a cut on the detection observable but the functional form 
is flexible enough to address even more complicated cases. The 
choice of the angular diameter distance over the luminosity dis¬ 
tance in Eq. 0 is irrelevant since the two differs for a factor 
( 1 - 1 - 2 )^ which can be approximately englobed in for limited 
redshift baselines. 

The evolution in the dispersion of the mass sample is mainly 
connected to the intrinsic scatter in the relation used to select the 
samples, see App. The redshift dependence can then be mod¬ 
elled as in Eq. Gl- 


3 REGRESSION SCHEME 

When we calibrate a scaling relation we deal with: i) the true mass 
of the cluster Ma, which we cannot measure; ii) a scattered (and 
likely biased) proxy of the true mass, such as the weak lensing mass 
Mwl.a, which is the proxy we considered in following, or the hy¬ 
drostatic mass Mhe.a (see § 2 and app. A of |CoMaLit-I^ ; in) an 
observable quantity O, which we assume to be on average related 
to the true mass with a power-law. 

In logarithmic variables, the median scaling relation is approx- 
imatively linear and the scatter is Gaussian. As discussed in Sec.|^ 
the scaling can be expressed as 

log{E~'^=‘‘0) = a /SlogMA + 7 z logi^z- (18) 

Since we englobed the self-similar evolution in the left-hand of 
Eq.@, values of the parameters 72 which are different from zero 
denote deviations from the self-similar time dependence. In other 
words, the time evolution of the scaling relations 7 ^ is relative to 
that predicted by the self-similar model. Given a particular scaling 
law, there is negative, i.e., 7 z < 0 (positive, i.e., 72 > 0 ) evolution 
if the normalisation at high redshift is lower (higher) than antici¬ 
pated from the self-similar scaling. 

In what follows, which is the general scheme we employed for 
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Table 1. Parameters of the regression scheme and their description. The 
variables Z, X, and Y denote (the logarithm of) the true mass, the WL 
mass, and the self-similarly evolved observable, respectively. 


Type 

Meaning 

Symbol 

Yz = o!y\z + Py\zZ + Jz log Ez 


Conditional scaling relation 

Intercept 

^YIZ 


Mass evolution 

/3v|z 


Time evolution 

Iz 

— (^YlZ,0^z 



Conditional intrinsic scatter 

Conditional scatter at 2 = 0 

'7y|z.o 


Time evolution 

'^^Y\Z 

^Xjzi^J — ^XIZ,0-^z 



Intrinsic scatter of the WL mass 

Conditional scatter at 2 = 0 

'^X|Z,0 


Time evolution 

'^^X\Z 

Mz(z) = Mz + ly.z logEz + 7p 

IZ,D log Da{z) 


Mean of the mass function 

Normalisation 

Mz 


Time evolution with Ez 

ly-z 


Time evolution with Da 


(72 (z) = 



Dispersion of the mass function 

Dispersion at 2 = 0 

(^z,0 


Time evolution 

7<2z 


the regression analysis, we identify logMA with the variable Z, 
we identify the logarithm of the mass proxy, i.e., the weak lensing 
mass, with the random variable X, and we identify the logarithm 
of the self-similarly redshift evolved observable with the response 

Y. In this scheme, the mass is the covariate variable, as when using 
number counts of galaxy clusters to constrain cosmological param¬ 
eters. The observed values are denoted with the lower case, i.e., x 
and y are the manifest measured estimates of the latent X and Y, 
respectively jPeigelson & Babu|2012| ). This notation is the conven¬ 
tion adopted in the CoMaLit series. 

The conditional probability of X given Z is 

P{X\Z)^X-{Z,axiz{z)), (19) 

where J\f is the normal distribution. In Eq. l |I9| >, X is an unbiased 
proxy of Z. Any bias between X and Z would be degenerate with 
the estimated overall normalisation of the scaling between Y and 

Z. This bias cannot be determined with the data, which only con¬ 
strain the relative bias between X and Y (see |CoMaLit-I[ l. As dis¬ 
cussed in Section [TT] the redshift evolution of the scatter is mod¬ 
elled as 

^x\ziz) = crx\z,oEz ■ ( 20 ) 

The mean observable for a given mass is linearly related to the 
(logarithm of the) mass and the relation evolves with redshift, 

Yz = aY\z + PyizZ + Yz^ogEz', (21) 

the redshift z is deterministic and assumed to be known without 
measurement errors. Y is scattered and distributed according to the 
conditional probability 

P{Y\Z)=M{Yz,aY\ziz)), (22) 

with 

^Yiziz) = aY\z,oEz (23) 

The distribution of the masses can be approximated with a Gaussian 
function 

P{Z) = M{yz{z),az{z)). (24) 


The mass distribution resulting from usual selection procedures is 
fairly unimodal (see App. and can be approximated with the 
normal distribution of Eq. l|24^. The statistical improvement ob¬ 
tained considering more complex distributions, such as mixture of 
Gaussians with different means and variances, is usually negligible 
( |Kelly|2007|CoMaLit-II^ . 

As discussed in Sec. |2.2[ the evolution of the (mean of the) 
mass function can be modelled after Eq. l |17[ ( as 

yz[z) = fiz +7^2 log-Bz +7/.*z.D logB>A(2:). (25) 

The dispersion evolves as 

az{z) = azfiE]”^ . (26) 


The completeness function at a given redshift can be computed 
by dividing the estimated mass function (Eq.|24^ by the cosmolog¬ 
ical halo mass function. This approach requires the knowledge of 
the effective survey area of the sample, which may be difficult to 
estimate for heterogeneous samples. Alternatively, we can use the 
approximate formulae presented in App.fA] which were derived un¬ 
der the assumptions that the completeness function can be approxi¬ 
mated as a complementary error function and that the cosmological 
halo mass function can be approximated as a power-law. 

If we assume that the uncertainty in the measurement process 
is Gaussian, the relation between the unknown Xi and Yi and the 
measured Xi and yi is given by 

= Ar™({Ai,yi},VOW(t/th,i,oo), (27) 


where and U are the bivariate Gaussian and the uniform dis¬ 
tribution, respectively. In Eq. l |27| l, Vi is the symmetric uncertainty 
covariance matrix whose diagonal elements are denoted as <5^ i and 
5y_i, and whose off-diagonal elements are denoted as Pxy5x,iSy,i. 

The truncation, i.e., null probability for yi < yth,i, accounts 
for selection effects when only clusters above an observational 
limit (in the response variable) are included in the sample, i.e., the 
Malmquist bias ^CoMaLit-II[ ). 

The treatment is complete once the priors on the parameters 
are made explicit. We choose non-informative priors as discussed 
and ICoMaLit-II The priors on the intercept aY\z 


CoMaLit-I 


and on the mean pz are taken to be flat. 


oly\z, M 2 ~ W(—1/e, 1/e), (28) 

where e is a small number. In our calculation we took e = 10“^. 
A priori, the slopes follow the Student’s ti distribution with one 
degree of freedom, as suitable for uniformly distributed direction 
angles. 


I3y\Z: 72) 7)2x12) T)2 x|2’ 7m2) 7m2,.c>) 7)^2 ^ (29) 

The Student prior for the slopes is not informative. Negative time 
evolutions and scatters which decreases at early times are allowed. 
For the variances, we adopted an inverse Gamma distribution, 

l/crxiz.O) l/o'v|z,0) l/crl.o ~ r(e, e). (30) 

This regression scheme requires 12 parameters, i.e., three pa¬ 
rameters characterising the scaling relation, two for the intrinsic 
scatter, two for the mass scatter, and five for the mass function, plus 
three variables for each cluster, i.e., the true weak lensing mass, the 
true mass, and the true observable. The parameters and their mean¬ 
ings are summarised in Table[^ 

The relation in Eq. expresses the conditional scaling re¬ 
lation, wherein Yz is the most likely value of the variable Y for 
a given Z. This is the relation to be used to predict the value of 
y for a given Z. The relation between two random and scattered 
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variables might be better described by the symmetric scaling rela¬ 
tion, which goes along the direction where the probabilities of Z 
and Y are maximised at the same time ( |CoMaLit-II^ . In the above 
regression scheme, where Z and Y follow a bivariate normal dis¬ 
tribution, the slope of the symmetric scaling can be expressed as 
( |CoMaLit-II| > 


(jy 'J'^z + 2(2py2 1 ^ 

PY-Z = --, (Jl) 

ZpYZ Crz(Ty 

where pYz is the correlation factor between Y and Z, 

Py\z 


pYZ = ■ 


V|Z 


/-|-/3^|Z 


(32) 


and the variance in Y is related to the conditional scatter as 


Oy — Oyt^z 0Y\Z'^%- (33) 

The intercept of the symmetric relation can be expressed as 


ay-z = cs,y]^z + {Py\z — I3y-z)pz- (34) 

The detailed regression scheme is simplified when we are in¬ 
terested in the scaling between the observable and the measured 
proxy mass, i.e., the WL or the X-ray mass. 


log(i ?2 = a -I- ,3 log Mproxy + 7^ log-Ej. (35) 


In this case, the adopted form for the scaling is 


Yx = ay\x + Py\xX + (36) 

which substitutes Eq. 0- The latent variable Z coincides now 
with the manifest one X and we do not have to model the condi¬ 
tional probahility of X given Z, see Eqs. l|19|l and (|20^. 


4 CLUSTER CATALOGS 

There are different approaches to choose a sample of clusters to 
analyse. We may look for a statistical sample which is complete 
with respect to well-defined selection criteria. This sample would 
be ideal but most of the massive clusters with very good quality 
data might be excised. The alternative is to assemble samples as 
numerous as possible with the idea that variety and largeness can 
compensate for incompleteness and inhomogeneity. 

These two approaches are to some degree complementary and 
have been already discussed in |CoMaLit-II| and |CoMaLit-III[ which 
we refer to for further considerations. Here we are mainly interested 
in testing the regression algorithm and we focus on large samples. 
To this aim we assembled a catalog of clusters with measured ve¬ 
locity dispersions. As catalogs of weak lensing masses, optical rich¬ 
nesses, X-ray luminosities, and SZ effects, we used publicly avail¬ 
able compilations. The subsample of weak lensing clusters also in¬ 
cluded in either the velocity dispersion, richness. X-ray, or SZ cat¬ 
alogs were used in the analysis presented in the next section. We 
briefly discuss the main properties of the catalogs and refer to the 
original references for further details. 


4.1 Weak lensing masses 

|CoMaLit-III| retrieved from literature 822 weak lensing analyses 
of clusters and groups with measured redshift and mass. Here, we 
consider the hC^-single, which contains 485 unique entries with 
reported coordinates, redshift, and WL masses to over-densities of 
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2500, 500, 200, and to the virial radius]^ Duplicate entries from 
input references were carefully handled. 

The cluster redshifts span a large interval, 0.02 < z < 1.5. 
The catalog is large and standardised but it is not statistically com¬ 
plete. We refer to |CoMaLit-III| for a detailed discussion of the cat¬ 
alog properties. 


4.2 Velocity dispersions 

We assembled some publicly available catalogs of clusters with 
measured velocity dispersions. We first review the source catalogs 
and then we introduce the merged compilation. 


4.2.1 Source catalogs 


|Cava et ^L| p009| > presented the results from the spectroscopic 
survey WINGS (Wide-held Nearby Galaxy-cluster Survey)-SPE, 
which consists of 48 nearby clusters at 0.04 < 2 < 0.07 selected 
from three X-ray flux limited samples. They complemented the 
sample with 29 additional clusters not observed in the programme 
but for which literature data existed. The total sample contains 77 
clusters over a broad range of richness, Bautz-Morgan class, and 
X-ray luminosity. 

[Ebeling et al.H2007} presented the sample of the 12 most dis¬ 
tant galaxy clusters detected at 2 > 0.5 by the Massive Cluster 
Survey (MACS). This catalog is statistically complete and compre¬ 
hensive of measurements of radial velocity dispersions. 

IGirardi & Mezzettl] ( |2001[ l considered a sample of 51 distant 
galaxy clusters at 0.15 < 2 < 0.9, each cluster having at least 10 
galaxies with available redshift in the literature. In some clusters, 
two peaks that are not clearly separable were identified in the veloc¬ 
ity distribution. Eor these systems with uncertain internal dynam¬ 
ics, we considered the velocity dispersion measured by analysing 
the identified peaks together. We also discarded two systems with 
no major peak (CL J0023-I-0423 and CL J0949-I-44). 

|Mazure et al.| ^1996) constructed a volume limited sample of 
128 clusters out to 2 = 0.1 combining data from the ENACS (ESO 
Nearby Abell Clusters Survey) with pre-existing data from litera¬ 
ture. They measured reliable velocity dispersions for a subset of 
80 of them, based on at least 10 redshifts. They also analysed 26 
additional clusters in the cone but with 2 > 0.1. The total catalog 
consists of 106 clusters. We discarded from our final catalog the 
secondary systems. 

[Oegerle & Hill|p001[ > presented the spectroscopic study of a 
sample of 25 Abell clusters out to 2 = 0.1 containing a central 
cD galaxy. Redshifts measured with the MX Spectrometer were 
combined with those collected from the literature to obtain typi¬ 
cally 50-150 observed velocities in each cluster. We used the esti¬ 
mates of the velocity dispersions within the smaller quoted aperture 
(~ 1.2 Mpc/h at 2 = 0.1) 

|Popesso et al.| ( [2007l l considered a sample of 137 optically se¬ 
lected and spectroscopically confirmed Abell clusters in the SDSS 
(Sloan Digital Sky Survey) database jAdelman-McCarthy et al.| 


2006 1 . The clusters span the redshift range 0.04 < 2 < 0.17. 40 of 


the clusters were X-ray under-luminous, since they had a marginal 
X-ray detection or remained undetected in the ROS AT All Sky Sur¬ 
vey. 


^ The catalogs are available at http;//pico . bo . astro . it/ 
~sereno/CoMaLit/LC2/ 
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|Rines & Diaferio| P006[ l studied the infall patterns of 72 
nearby {z < 0.1) clusters from the Data Release (DR) 4 of the 
SDSS. The clusters were selected in X-ray flux from the ROSAT 
All-Sky Survey. Velocity dispersions were measured and masses 
were derived with the caustic method. rRines & Diaferio| ( |2010[ > ex¬ 
tended the approach to a sample of 16 groups with lower X-ray 
fluxes selected from the 400 deg^ serendipitous survey of clusters. 
Spectroscopic data were taken from the SDSS DR5. |Rines et al.| 
( |2013^ selected 58 clusters by their X-ray flux and in the redshift 
interval 0.1 < 2 < 0.3 to build the Hectospec Cluster Survey 
(HeCS), the first systematic spectroscopic survey of cluster infall 
regions at 2 > 0.1. For each cluster, high signal-to-noise spectra 
for ~200 cluster members were acquired with MMT (Multi Mirror 
Telescope )/Hectospec. 

|Ruel et aL| ( |2014| l presented optical spectroscopy of galax¬ 
ies in clusters detected through the SZ effect with the South Pole 
Telescope (SPT). They reported measurements of 61 spectroscopic 
cluster redshifts, and 48 velocity dispersions each calculated with 
more than 15 member galaxies. After the inclusion of additional 
measurements of SPT-observed clusters previously reported in the 
literature, the final catalog presents 57 velocity dispersions. Being 
SZ selected, most of the clusters are at high redshift. The clusters 
span an interval 0.3 "S z 1.5. 

|Sif6n et ^^2013^ presented the dynamical analysis of a sam¬ 
ple of 16 SZ selected massive clusters detected with the Atacama 
Cosmology Telescope (ACT) over a 455 deg^ area of the southern 
sky. 60 member galaxies on average per cluster were observed with 
deep multi-object spectroscopic observations. The sample spans the 
redshift range 0.3 < 2 < 1.1 with a median redshift 2 = 0.50. 

|Zhang et al.| ( |20ir| ) presented a multi-wavelength analysis of 
62 galaxy clusters in the HIFLUGCS (Highest X-ray FLUx Galaxy 
Cluster Sample), an X-ray flux-limited sample. Velocity disper¬ 
sions were computed thanks to 13439 cluster member galaxies with 
redshifts collected from literature. Most of the clusters (60 out of 
62) are at 2 < 0.1. 


4.2.2 Merged catalog 


The catalogs listed before provides a total of 710 velocity disper¬ 
sion estimates, comprehensive of multiple peaks and substructures 
which we did not consider in our final sample. 

Cluster coordinates were taken from the original or from com¬ 
panion papers. When they were not reported, we used the coordi¬ 
nates listed by the NASA/IPAC Extragalactic Database (NED)[^ 
Not unique entries were identified by matching names and 
cluster coordinates. For clusters with multiple analysis, we pre¬ 
ferred the study based on the larger number of identified clus¬ 
ter member galaxies with measured redshift, Wmembers. The fi¬ 
nal catalog contains 564 unique clusters. The catalogs are pub¬ 
licly available at http://pico.bo.astro.it/~sereno/ 
CoMaLit / sigma/ Their format is detailed in App.[B] 

When original estimates were provided with asymmetric er¬ 
rors, we computed the mean value as suggested in |D’Agostmi 
1 2004^ . To standardise the uncertainties, we followed |Ruel et ~ 
1 2014^ . They found that the uncertainty in the velocity dispersion 
(jy is well described by 


5 .,, = 


0.92crv 




-1 


(37) 


when including the effect of membership selection. 

As we inferred from the matching with the LC^-single, WL 
masses are known for a subsample of 97 clusters. This size can be 
achieved only relying on a number of different source catalogs. 30 
clusters are from |Girardi & Mezzetti| ( [2001| l, 23 from Rines et ^ 
( |2013^ , 13 from|Zhang et^ (|201 1|>, 11 from|Ebeling et ST ( |2007| >, 


8 from Ruel et akfi 2014^ , 5 fr om |Pop esso et al~|^ 2007 1 , 4 from 
[Mazure et al.| ( |199^ , and 3 from |Sif6n et al.| ( |2013l . 


4.3 Optical richness 

[Rykoff et al.| j2014[ l applied the redMaPPer (red-sequence 
Matched-filter Probabilistic Percolation), a red-sequence cluster 
finder designed for large photometric surveys, to ~ 10000 deg^ 
of SDSS DR8 data. The resulting catalog contains ~ 25000 can¬ 
didate clusters over the redshift range 0.08 < 2 < 0.55. 

According to the catalog convention, the richness A of a clus¬ 
ter is defined as the sum of the probabilities of the galaxies found 
near a cluster to be actually cluster members. The sum extends over 
all galaxies above a cut-off luminosity (0.2L*) and below a radial 
cut which scales with richness. Clusters are included in the catalog 
if their richness exceeds 20 times the scale factor 5'rm (also pro¬ 
vided in the catalog), which is a function of the photometric redshift 
of the cluster. This selection criterion approximately requires that 
every cluster has at least 20 galaxy counts above the flux limit of the 
survey or the cut-off luminosity at the cluster redshift, whichever is 
higher. 


4.4 X-ray clusters 

[Maughan et al.| ( |2008| l presented a sample of 114 clusters cov¬ 
ering wide temperature (2 < ksT < 16 keV) and redshift 
(0.1 ^ z < 1.3) baselines. The sample was assembled from all 
publicly available Chandra data as of 2006 November. It consists 
of clusters at redshift greater than 0.1 listed in the NED which were 
the targets of observations made with the ACIS-I detector covering 
at least half of the area in the annulus [0.9-1.0] rsoo. The radius 
rsoo was estimated assuming a Msoo-Tx relation. 

The sample was later reanalysed using updated softwares and 
calibration files in |Maughan et al.| ( |2012| l. Bolometric luminosities 
were measured either in the [0-1] rsoo aperture, which we took as 
reference case to ease the comparison with theoretical predictions, 
or in the core-excised [0.15-1] rsoo aperture, Lx.ce. This large cat¬ 
alog is standardised in the measurement procedures but it is not 
statistically complete. 

As an alternative we also looked at catalogs of X-ray lumi¬ 
nosities measured in the [0.1-2.4] keV band, We consid¬ 

ered the MCXC (Meta-Catalogue of X-ray detected Clusters of 
galaxies, [Piffaretti et al.|[2011) , which comprises 1743 unique X- 
ray clusters collected from available ROSAT All Sky Survey-based 
and serendipitous cluster catalogues. X-ray luminosities were sys¬ 
tematically homogenised and standardised to an over-density of 
A = 500. Uncertainties are not provided in the catalog. For our 
tests, we fixed the statistical uncertainty to 10 per cent. As the LC^, 
the MCXC is not statistically complete. 


^ http://ned.ipac.caltech.edu/ 


^ We used the latest version of the catalog (v5.2), which is publicly avail¬ 
able at http ://risa.Stan ford, edu/redMaPPer/ 
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4.5 Planck SZ catalog 

The Planck SZ Catalogue (PSZ, [Planck Collaboration et al.|2014a^ 
contains 883 candidates identified with the Matched Multi-filter 
method MMF3 with detections above SNR = 4.5. The catalog spans 
a broad mass range from 0.1 to 16 x IO^^Mq at a median redshift 
of 2 ~ 0.22. The redshift determination is available for 664 candi¬ 
dates. 

In |CoMaLit-II| we computed the spherically integrated Y 500 
of the PSZ clusters within the weak-lensing determined rsoo. The 
measurements of Mwl.soo and Tsoo are then correlated. In our 
analysis, we used the full uncertainty covariance matrix. We refer 
to lCoMaLit-IIl for a detailed discussion. 


5 RESULTS 

We analysed the scaling between mass and optical, X-ray, or SZ 
observables using the general regression scheme detailed in Sec- 
tionj^ In fact, the uncertainties on the redshifts were assumed to be 
negligible and the factors Ez were assumed to be known without 
errors. The Bayesian hierarchical model was implemented through 

jagsE] 

According to the notation of Section]^ the X variable is the 
logarithm (to base 10 ) of the observed weak lensing mass (com¬ 
puted at an over-density of either 200 in case of scaling of optical 
observables or 500 for observables related to the gas); the Z vari¬ 
able is the logarithm of the unknown true mass; the observable Y 
is the logarithm of either the galaxy velocity dispersion, the optical 
richness, the X-ray luminosity, or the spherical SZ signal multiplied 
by ■ Any deviation of the parameter 7 ^ from the null value 

implies a deviation from the self-similar evolution with redshift. 

In the case of optical richness and SZ signal, we had to con¬ 
sider the correction for the Malmquist bias. The threshold value of 
the optical richness above which candidate clusters are included in 
the redMaPPer catalog was given by 20 times the scale factor at the 
cluster redshift jRykoff et al.|2014| >. According to the notation of 
Section |A3] the threshold for the i-th cluster is 

j/th,i = log(205RM,i). (38) 

The limiting SZ flux of the Planck clusters was obtained by 
multiplying the minimum SNR(= 4.5) by the uncertainty on Y 500 
( |CoMaLit-II[ l. In this case, 

j/th.i = log(4.5T;"i'^®Di(2i)5Y5oo,i)- (39) 

The results of the regression are summarised in Table [^ for 
the scaling relations and in Tablej^for the mass functions. Table [^ 
summarises the results of the scaling of the observables versus the 
WL mass. Parameter degeneracies are illustrated as bi-dimensional 
contour plots in Figs. □Ha an dj^ Figuresj^j^jT] and [^show the 
scaling relation and the evolution of the completeness function for 
frv-M 2 oo, A-M 200 , Tx-Msoo, and Fsz-Msoo, respectively. 

To ease the comparison with theoretical predictions, we also 
computed the parameters of the the symmetric scaling relation (see 
Table j^. We did not require that Py-z is redshift independent. 
However, the slopes turned out to be constant within the errors. 

® JAGS (Just Another Gibbs Sampler) is a program for analysis of 
Bayesian hierarchical models using Markov Chain Monte Carlo simulation. 
It is publicly available at http: / / mcmc- jags . source forge . net/ 
An example of JAGS script used for the analysis can be found at http: 

//pico.bo.astro.it/-sereno/CoMaLit/JAGS/. 


Slopes and intercepts of the symmetric relations in Tables [^ and [^ 
were computed at the median redshifts of the samples. 

We obtained significant constraints on the evolution with red¬ 
shift of the scaling relations and of the mass functions. On the other 
hand, the uncertainties on the evolution of the intrinsic scatters are 
too large to come to any conclusion. 


5.1 Parameter degeneracy 

Most of the regression parameters are uncorrelated (see Figs. [T]- 
1^. Since a significant percentage of the massive clusters is at high 
redshift, the time evolution can partially mimic the effects of the 
mass evolution (see the /3y|z-7z panel). This degeneracy is most 
pronounced for the Planck selected clusters, see Fig.j^ whose mass 
completeness limits steadily increases with redshift, see Fig. To 
obtain unbiased estimates of the scaling relations is then cmcial to 
properly account for time-evolution and selection effects. 

The estimate of the normalisation of the scaling relation, 
ay|z, is correlated with the slopes, I3y\z and 7 z. Slope and nor¬ 
malisation are correlated to the intrinsic scatter too. The Pearson 
correlation coefficient between aY\z Wy\z) and (JY\zfi is 0.38 
(-0.38) for the (Tv-Mwl sample, 0.72 (-0.75) for the A-Mwl sam¬ 
ple, 0.50 (-0.53) for the Lx-Mwl sample, and -0.08 (-0.31) for the 
Ysz-Mwh sample. 

The parameters characterising the scaling, i.e., /3v|z, 

and "/z, are not degenerate with the mass functions. Furthermore, 
as already noted in |CoMaLit-I| since ax\z and aY\z spread the 
observed relation in orthogonal directions, they are nearly uncorre¬ 
lated too. 

The only remaining significant degeneracy is among the pa¬ 
rameters characterising the normalisation of the mean value of the 
mass function, flz, and its evolution, parameterised in terms of 
'Itiz.o and 7 pz. The evolution parameters of the (mean of the) 
mass function, ■y 11^,0 and 7 ^^, are degenerate too, see Eqs. © 
and ( |25^ . In fact, the redshift dependence in small intervals can be 
modelled either in terms of Ez or in terms of the distance. 


5.2 Scaling with the weak-lensing mass 

Results for the scalings with the WL mass are reported in Table [^ 
In this case, the general regression scheme simplifies as described 
in Sec.[^ Due to the intrinsic scatter of the WL mass with respect to 
the true mass, the conditional relation 0-Mwl,a is systematically 
flatter than the corresponding O-Ma, whereas the intrinsic scat¬ 
ter of the relation is larger ( [CoMaLit^ |CoMaLit-II| l. On the other 
hand, results for the symmetric scaling relations are consistent. 

Parameters of the 0-Mwl,a are recovered with better accu¬ 
racy but they cannot be used straight on in the conditional O-Ma 
to make predictions based on cosmological functions. In absence of 
a full regression procedure modelling the intrinsic scatter between 
true mass and proxy mass, at least some corrections should be used 
HCoMaLit-I^ . 


5.3 Self-similarity 

We found that scalings with mass (Py-z ~ Paa) and redshift 
(72 ~ 0 ) are fully compatible with the self-similar behaviour, see 
Tables [^ and [^ The lone exception is the mass-X-ray luminosity 
relation, whose dependence with mass is slightly steeper than the¬ 
oretical predictions even if compatible within the statistical uncer- 
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Figure 1. Probability distributions of parameters of the scaling relation between velocity dispersion and mass, (Tv-M2oo> and of the mass function. The 
thick (thin) lines include the l-(2-)(7 confidence region in two dimensions, here defined as the region within which the value of the probability is larger than 
exp[—2.3/2] (exp[—6.17/2]) of the maximum. 


tainty (Py-z = 1-78 ± 0.25 vs. Pes = 4/3), and whose evolution 
with redshift is preferentially negative (72 = — 1.8 ± 0 . 8 ). 

We tested that these general features of the Lx-Ma relation 
do not depend on the different ways in which the X-ray luminos¬ 
ity can be measured. Bolometric luminosities estimated in a re¬ 
gion from which the core is excised are supposed to be slightly 
less sensitive to the baryonic physics. Considering the Lx.ce from 
[Maughan et al.|( 2012 | l, we found Py-z = 1-47 ± 0.18, 7 z = 
— 1.39 ± 0.57, and aY\z = 0.09 ± 0.05. As expected, the Lx.ce- 
Ma relation is more tight, with a smaller intrinsic dispersion, and 
it can be determined with a better statistical accuracy. Neverthe¬ 


less, the evolution in redshift is still negative and fully compatible 
with the result obtained for luminosities accounting for the core re¬ 
gions. The dependence with mass is slightly less pronounced but 
still steeper than the self-similar expectation. 

We also evaluated the mass-X-ray luminosity relation in the 
soft band by considering the MCXC catalog, which comprises 193 
clusters with measured WL mass. For the I/x^oft-AfA relation we 
found Py-z = 1-43 ± 0.15, which is steeper than the self-similar 
prediction Pas = 1 , and a negative redshift evolution which devi¬ 
ates from the expected yss = 2 (see e.g. |EttorI| ( |2015| l for a deriva¬ 
tion of the self-similar predictions in the [0.1-2.4] keV band) by 


© 0000 RAS, MNRAS 000, 000-000 





































CoMaLit-IV 9 



Figure 2. Probability distributions of parameters of the scaling relation between optical richness and mass, X-M 200 , and of the mass function. The thick (thin) 
lines include the confidence region in two dimensions, here defined as the region within which the value of the probability is larger than exp[—2.3/2] 

(exp[—6.17/2]) of the maximum. 


= —1.80 ± 0.59. As expected for the MCXC catalog, whose 
luminosities were standardised from heterogeneous data sets, the 
intrinsic scatter is over-estimated, aY\z = 0.18 ± 0.07. 

5.4 Intrinsic scatters 

The intrinsic scatter of the weak leasing mass with respect to the 
true mass is estimated to be of the order of ~ 25 ± 10 per cent. 
This is slightly larger but still compatible within errors with the 
scatter measured in |Mantz et al.| ( [2014t or in |CoMaLit-H and with 
predictions based on numerical simulations jRasia et al.|2012^. In 


|CoMaLit-II[ we noted that the scatter measured in heterogeneous 
samples, such as hC^-single, may be overestimated due to not co¬ 
herent formulated statistical uncertainties on weak leasing masses. 
In the case of the richness calibration, both estimated weak leas¬ 
ing scatter and related errors doubles, so that the difference in the 
central estimates of the scatter is not statistically significant. 

We confirm that mass proxies based on velocity dispersions 
are among the most accurate (scatter of ~14 per cent). We verified 
that mass proxies based on X-ray luminosity are noisier (scatter of 
~30 per cent) than those based on the integrated SZ effect (~25 
per cent), in agreement with numerical simulations ([Stanek et al.| 
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Figure 3. Probability distributions of parameters of the scaling relation between X-ray luminosity and mass, Ly^-M^oo, and of the mass function. The thick 
(thin) lines include the 1-(2-)(t confidence region in two dimensions, here defined as the region within which the value of the probability is larger than 
exp[—2.3/2)] (exp[—6.17/2]) of the maximum. 


|2010[ l. The richness, with an intrinsic scatter of ~40 per cent, may 
compete with the X-ray luminosity. The scatter in proxies based on 
the X-ray luminosity strongly depend on the measurement/analysis 
process. Luminosities estimated in core-excised regions are less 
scattered (~20 per cent), whereas inconsistencies in the measure¬ 
ment process and not uniform data-sets can boost the scatter up to 
~ 40 per cent. 

The catalog of velocity dispersion used to study the (TV-M 200 
relation is heterogeneous which might bias the relation and inflate 
the estimated intrinsic scatter. Firstly, the central estimates of the 
velocity dispersion were estimated in the different source papers 


with different methods. However, these statistical differences are 
very small. [^el et al.| ( |2014^ presented two independent measure¬ 
ments of (Jv, based either on the measured gapper scale or the bi¬ 
weight dispersion. The two estimates are very well consistent, with 
a distribution of relative differences whose centre deviates from 
zero by less than one per cent and whose scatter is less than 2 per 
cent. 

Secondly, systematic differences in the measurements by in¬ 
dependent groups might play a role. We checked that results based 
on the merged catalogs are fully consistent with those from sin¬ 
gle source catalogs with homogeneous estimates of (jy. 31 clusters 
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Figure 4. Probability distributions of parameters of the scaling relation Ygz-^SOO, and of the mass function. The thick (thin) lines include the 1-(2-)(t 
confidence region in two dimensions, here defined as the region within which the value of the probability is larger than exp[—2.3/2] (exp[—6.17/2]) of the 
maximum. 


from |Girardi & Mezzetti|j^01^ have measured WL mass. For these 
clusters, we found aY\z = 2.74 ± 0.33, /3y\z = 0.22 ± 0.34, 
7 z = 0.04 ± 0.54, and (Ty\z,o = 0.07 ± 0.03. For the 26 clus¬ 
ters from |Rines et al.| ( |2013| l with measured WL mass, we found 
aY\z = 2.72 ± 0.27, i3y\z = 0.26 ± 0.29, 7 ^ = -0.56 ± 1.23, 
and <Jy\z,o = 0.07 ± 0.02. 

5.5 Completeness function 

The evolution of the completeness function is obtained through the 
analysis of the redshift dependence of the distribution of selected 


true masses. In Figs.|^|^|7] and we plotted the 15, 50, and 85 
per cent completeness limit as a function of the redshift. The com¬ 
pleteness was approximated as a complementary error function, see 
App.[Al whose scale and dispersion were derived from the param¬ 
eters of the fitted mass function through Eqs. jA9| ) and jAlQ| l. 

For the samples of clusters with weak lensing mass and veloc¬ 
ity dispersion, SZ flux, or X-ray luminosity, the larger the redshift 
the larger the mass at a given completeness limit, see Figs.j^l^ and 
[S] This is not the case for the WL clusters in the redMaPPer cata¬ 
log, when the limits are nearly redshift independent, see Fig.|^ The 
apparent spike in some completeness functions at high redshift, see 
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Table 2. Observed scaling relations, Ez Conventions are as in Sectionj^and Table^ Z = log(M^); X = log(MwL,A); 

Y = log(£^^"’'®®(Tv), A), or log{E^'^^^D^Y^oo)- 7 ss is the self-similar evolution, which is equal to 1/3, 0, 7/3, 2/3 for Cv—, A-, 

Lx— and Yqz-^^a^ respectively. Units are 10^‘^Mq for mass, km/s for cry, 10"^^ ergs s“^ for the bolometric luminosity Lx, and 10~'^Mpc^ for D^^Y^qq. 
Cols. 1-3: variables of the regression procedure. Col. 4: number of clusters in the sample (A^ci)- Col. 5: median redshift of the sample. Cols. 6, 7, and 8: 
intercept, mass slope, and time evolution of the conditional scaling relation. Cols. 9-10: local scatter of the WL mass and its time-evolution. Cols. 11-12: 
intrinsic scatter of the scaling relation and its time-evolution. Cols. 13-14: intercept and slope of the symmetric scaling relation at the median redshift of the 
sample. Col. 15: self-similar prediction for the slope i^ss)- 


Conditional scaling WL mass scatter Intrinsic scatter Symmetric scaling 


10^ 

10^ 

10'" 

JVci 

z 

Oi-Y\Z 

?Y\Z 

Iz 

^X\Z,Q 

^^x\z 

^YIZ,0 

iz 

0!Y-Z 

/^Y-Z 

Ab 

A/200 

A/wl,2oo 


97 

0.23 

2.67±0.12 

0.30±0.13 

-0.05 ± 0.30 

0.11 ±0.06 

0.00± 1.26 

0.06 ±0.02 

0.17 ±0.96 

2.63 ±0.14 

0.34±0.15 

1/3 

A/200 

A/wl,2oo 

A 

157 

0.30 

1.29±0.21 

0.70 ±0.27 

0.39 ±0.58 

0.22 ±0.12 

-0.56 ± 1.22 

0.18±0.08 

-0.55 ± 1.11 

1.12±0.21 

0.91 ±0.25 

1 

A/500 

A/wl,5oo 


73 

0.38 

-0.13±0.26 

1.60 ±0.27 

-1.74 ±0.75 

0.11 ±0.05 

-0.04 ± 1.01 

0.12±0.07 

0.03 ± 1.16 

-0.29 ± 0.24 

1.78±0.25 

4/3 

A/500 

A/wl,5oo 

E-^I^DlY,oa 

115 

0.23 

-0.23 ±0.04 

1.27 ±0.21 

-0.76 ±0.80 

0.11 ±0.04 

-0.25 ±1.20 

0.11 ±0.05 

-0.37 ± 1.32 

-0.27 ± 0.06 

1.50±0.21 

5/3 


Table 3. Mass functions of the observed samples. Conventions and units are as in Tables^and]^ 




Mean 


Dispersion 

Sample 






(TV-AL200 

1.08±0.17 

0.18±0.16 

0.69 ± 0.86 

0.16 ±0.04 

0.74 ±0.96 

A-M200 

0.79 ±0.21 

0.01 ±0.24 

-0.13 ±0.92 

0.31 ±0.07 

-0.04 ± 0.77 

Tx-Msoo 

1.02 ±0.32 

0.17 ±0.36 

-1.03 ± 1.08 

0.17 ±0.04 

1.30 ±0.77 

Tsz-Msoo 

0.51 ±0.15 

0.57 ±0.15 

0.53 ±0.79 

0.17 ±0.03 

0.03 ± 0.79 


Figs. EE and|^ is not statistically significant and more regular 
evolutions are fully compatible with our results. 

Few high redshift clusters lie above the 85 per cent complete¬ 
ness limit. This is expected since high mass clusters are very rare 
at high redshifts and few of them have measured WL mass. 


Even though the redMaPPer catalog of optical richnesses 
([Rykoff et al.|2014l and the Planck catalogs ( [Planck Collahoration| 
jet al.|201~ I are statistically complete, we do not expect to exactly 
recover the selection function of the full catalogs. In fact, the sub¬ 
samples of clusters with measured weak lensing masses may be 
biased with respect to the full catalogs. 


The derived completeness functions refer to the subsample of 
the cluster in the catalogs with measured WL mass. The catalog of 
WL masses is heterogeneous which makes the studied subsamples 
heterogeneous too, even in the case of parent catalogs constructed 
with well defined selecfion functions. However, despite the fact that 
we do not expect that the completeness function of the subsamples 
with WL masses strictly resembles the completeness of the parent 
catalog, some similarities are still in place. The redshift evolution of 
the derived completeness limits of the SZ flux-WL catalog follows 
that of the Planck clusters, see the upper panel of Fig.[^ The shift 
in normalisation reflects the fact that masses used in IPlanck Collah-I 
[oration et al.H2014^ to derive the average limit were based on the 
Yz proxy which severely under-estimates the true masses ( [von der[ 
[Linden et al.|2014|CoMaLit-II^ . Furthermore, the subsample with 
WL masses under-represents the clusters with low SNR just above 
the threshold ( [CoMaLit-II^ . The larger mean SNR of the subsample 
determines a larger average limit, as we observed. 


The flatness of the completeness limits of the optical richness- 
WL suhsample (see upper panel of Fig.|^ is also connected to the 
properties of the parent sample. The redMaPPer catalog is in fact 


nearly complete at z < 0.3 and A > 30 (|Rykoff et al. 


2014 i. 


5.6 Mass function 

The modelling with a redshift-evolving Gaussian function, see 
Eqs. ( [24[ [25[ l and \26\ , is functional as far as the mass distribu¬ 
tion is fairly unimodal and the redshift evolution is smooth. In the 
case of a cluster sample selected through a hard cut in the observ¬ 
able, these assumptions are well justified, see Sec. and App.[A] 
However, the scheme is flexible enough to work even with hetero¬ 
geneous sample since detections and measurements are generally 
limited by observational thresholds. 

As far as the distribution is unimodal, a simple parameterisa- 
tion of the distribution of the covariate variable in terms of a single 
Gaussian function can determine unbiased estimates of the scaling 
relation. Results are full consistent with more complex parameteri- 
sations adopting mixtures of Gaussians ( [Kelly[2007|CoMaLit-II[ l. 

The observed WL mass functions in different redshift bins are 
well reproduced by the regression model. Distributions for the ve¬ 
locity dispersion-, optical richness-. X-ray luminosity-, and SZ 
flux-WL samples are shown in Figs. and[T^ respectively. 

The regression model computes the distribution of the true masses. 
To compare them with the observed distribution of WL masses, 
we had to smooth the distribution firstly with a Gaussian function 
whose standard deviation is the intrinsic scatter between true and 
proxy mass, which is computed by the regression too, and then with 
a Gaussian whose dispersion is given by the observational uncer¬ 
tainty on the WL masses. We considered the median uncertainty in 
the redshift bin. 


5.7 Predictions for the CLASH sample 

We can now make use of the constraints on the investigated scaling 
relations to obtain robust predictions on the observed properties of 
the galaxy clusters that are part of the CLASH program (Cluster 
Lensing And Supernova survey with Hubble, [Postman et al.[2012^ , 
which provides one of the samples best studied at different wave¬ 
lengths. 

The scaling relation av-M 2 oo can be used to predict the re- 
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Table 4. Scaling relations as a function of the WL mass, Ez O = 10“ ■ Listed pai'ameters refer to logarithmic variables. For the conditional 

scaling, the adopted form is Yx = <^y\x + ^Y\x^ Iz log ^z - Conventions and units are as in Sec. I^and TablesF^andj^ 



Conditional scaling 


Intrinsic scatter 

Symmetric scaling 

Relation 


Py\x 

Iz 

O'V|X,0 

7o--k|x 

OLY-X 

Py-x 

(Tv-MwL.200 

2.73 ± 0.05 

0.22 ± 0.05 

■0.07 ± 0.23 

0.07 ±0.01 

0.24 ±0.84 

2.71 ±0.06 

0.25 ± 0.06 

A-Mwl_200 

1.50 ±0.06 

0.45 ± 0.05 

0.41 ±0.51 

0.24 ± 0.04 

-0.59 ±0.88 

L33±0.10 

0.65 ±0.12 

Lx-MwL,500 

0.13±0.15 

1.29±0.14 

■2.00 ± 0.63 

0.20 ± 0.04 

0.23 ±0.80 

-0.16 ±0.21 

1.65 ±0.24 

TsZ-Mwl,500 

-0.25 ± 0.03 

L00±0.11 

■0.05 ± 0.54 

0.16 ±0.03 

-0.31 ±0.95 

-0.30 ±0.04 

1.41 ±0.21 


Table 5. Predicted line-of-sight velocity dispersions of the CLASH clusters. WL masses (col. 3) and redshifts (col. 2) are from the hC^-all. The quoted 
uncertainty on (Tv is statistical (col. 4), including the contribution from the uncertainty in the mass estimates col. 5), from the uncertainties in the 

scaling relation parameters ((Sct®^, col. 6), and from the intrinsic scatter in the ctv-Mwl scaling relation and the related uncertainty col. 7). 


Name 

Z 

■MwL,200 

[km/s] 

[km / s] 

[km / s] 

^^scatter 

[km/s] 

ABELL 383 

0.187 

8.1 ±2.2 

958 ± 164 

63 

25 

153 

ABELL 209 

0.206 

17.6 ±3.0 

1145 ±193 

64 

48 

187 

ABELL 2261 

0.224 

21.2±4.1 

1204 ±206 

79 

60 

200 

RXJ2129.3-H0005 

0.234 

5.5 ± 1.4 

902 ± 157 

61 

35 

147 

ABELL 611 

0.288 

14.1 ±3.9 

1139 ± 197 

77 

34 

183 

MS 2137.3-2353 

0.313 

12.4 ±4.8 

1123 ±206 

100 

29 

180 

RXC J2248.7-4431 

0.348 

20.2 ±6.7 

1274 ±231 

108 

55 

210 

MACS J1931.8-2635 

0.352 

14.7 ±6.4 

1192 ±225 

119 

37 

192 

MACS J1115.8-H0129 

0.352 

15.5 ±3.4 

1205 ±204 

70 

39 

196 

RXJ1532.9-H3021 

0.363 

7.1 ± 1.9 

1022 ± 177 

70 

35 

167 

MACS J1720.3-H3536 

0.391 

13.4±3.1 

1195 ±204 

70 

36 

192 

MACS J0416.1-2403 

0.396 

10.3 ±2.2 

1130± 191 

62 

32 

182 

MACS J0429.6-0253 

0.399 

9.4 ±3.0 

1109 ± 197 

85 

33 

180 

MACS J 1206.2-0847 

0.44 

15.8±3.6 

1272 ±219 

78 

46 

208 

MACS J0329.6-0211 

0.45 

9.9 ± 1.5 

1156±193 

55 

39 

189 

RXJ1347.5-1145 

0.451 

29.3 ±6.1 

1465 ±260 

110 

88 

250 

MACS J1149.5-H2223 

0.544 

25.2 ±5.2 

1495 ±265 

108 

85 

257 

MACS J0717.5-H3745 

0.548 

30.5 ±4.9 

1561 ±278 

114 

100 

272 

MACS J0647.7-H7015 

0.584 

13.1±4.2 

1326 ±245 

113 

63 

224 

MACS J0744.9-H3927 

0.686 

17.3 ±4.7 

1497 ±278 

127 

90 

261 


suits for the ongoing measurements of the velocity dispersions ac¬ 
quired as part of the extension with optical spectroscopic data of the 
objects in the CLASH sample (e.g., the VLT-VIMOS Large Pro¬ 
gram of 230 hours to carry out a panoramic spectroscopic survey 
of the 14 southern CLASH clusters). The CLASH-VLT program 
aims at obtaining redshift measurements for 400-600 cluster mem¬ 
bers and 10-20 lensed multiple images in each cluster field ( |Biviano| 
|et al.|20r3) . Additional observations from northern facilities, i.e., 
the MMT/Hectospec ( |Rines et al.|2013^ , will complement the pro¬ 
gram. 

For these predictions, we performed a new regression excis¬ 
ing from the sample the eight CLASH clusters previously included. 
Since we are interested in predicting the velocity dispersion given 
the weak leasing mass, we considered the conditional re¬ 

lation rather than the true mass-velocity dispersion (TV-M 200 . The 
results of the regression were in full agreement with the regression 
of the full sample: aY\x = 2.75 ± 0.05; I3y\x = 0.22 ± 0.05; 
7 ;^ = -0.02±0.24; aY\x,o = 0.07±0.01; = 0.07±0.91. 

Alternatively, we might have used the (JV-M 200 scaling considering 
the additional source of error given by the intrinsic scatter between 
the known weak leasing masses and the unknown true masses. As 
a general remark, if we have weak leasing masses we cannot make 


predictions by plugging them in scaling relations which compare 
for example the observable to the hydrostatic mass. 


We considered for the CLASH clusters the weak leasing 


Umetsu 


masses reported in hC'‘-all (CoMaLit-III 1 and based on 
|et al.| ( |2014| l, who performed a combined analysis of shear and 
magnification. The velocity dispersions based on the crv-MwL ,200 
relation are listed in Tabled The main source of statistical uncer¬ 
tainty on the predictions is due to the intrinsic scatter of the relation, 
which abundantly tops the uncertainties due to the propagated error 
on the WL mass or due to the uncertainties in the scaling relation 
parameters. 


The prediction for MACS J 1206.2-0847 compares well with 
the first measurement from CLASH-VLT jBiviano et al.|[20T3| 
CTy = 1087lg| km/s). The HeCS covered two clusters later on 
analysed in [Umetsu et al.|(2014[ >, i.e., A2261 and RXJ2129. The 
predict ion for RXJ2129 i s in excellent agreement with the measure¬ 
ments ( Riiies'etaLpOlS CTv = 858lg7 km/s). On the other hand, 
the pre diction slightlyCTc eeds the observed velocity dispersion of 
A2261 I Rines et al.|2013 ctv = 780lgQ km/s), even though the 
small discrepancy is fully covered by the uncertainty due to the in¬ 
trinsic dispersion of the scaling. 
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Figure 5. Top panel: completeness function of the clusters from the WL- 
velocity dispersion sample. Weak leasing masses Mvvl 200 shown as 
a function of redshift (black points). The full lines plot the value of the true 
mass M 200 where a given completeness level is reached as a function of the 
redshift. From top to bottom, the red, green, and blue lines show the 85, 50, 
and 15 per cent completeness levels, respectively. The shaded green region 
encloses the 68 per cent confidence region around the 50 per cent complete¬ 
ness level due to uncertainties on the mass function parameters. Note that 
the completeness is a function of the true mass, whereas the points refer to 
the weak leasing masses, which are scattered with respect to the true mass. 
Bottom panel: scaling between velocity dispersion and mass, M200- The 
black points mark the data (weak lensing mass and redshift evolved veloc¬ 
ity dispersion), the blue (green) line represent the conditional (symmetric) 
scaling relation fitted to the data (true mass versus redshift evolved velocity 
dispersion). The dashed blue lines show the median scaling relation (full 
blue line) plus or minus the intrinsic scatter. The shaded blue region en¬ 
closes the 68 per cent confidence region around the median relation due to 
uncertainties on the scaling parameters. The red line represents the theoret¬ 
ical prediction based on |Munari et al.|f2013) at the median redshift of the 
sample. Masses are in units of 10 ^'^ATq. 


6 COMPARISONS 

In this section, we compare our results to theoretical predictions or 
previous estimates. 


6.1 Theoretical predictions 

We first compare our results to predictions based either on theoreti¬ 
cal models of structure formation or on numerical/hydro-dynamical 
simulations. 


Figure 6. Top panel: completeness function of the clusters from the WL- 
optical richness sample. Lines and points are as in the top panel of Fig.|^ 
Bottom panel: scaling between optical richness and mass, M2oo- Black, 
blue, and green graphics are as in the bottom panel of Fig. 

6.7.7 L^-ATa 

IStanek et al.| ( |201()) l presented a computational study of the intrinsic 
covariance of cluster observables using the Millennium Gas Sim¬ 
ulations. Two different physical treatments were proposed: shock 
heating driven by gravity only, or a second treatment with cool¬ 
ing and preheating. The predictions in |Stanek et al.| ( [2010| > on the 
scaling between mass and bolometric X-ray luminosity are strongly 
dependent on the adopted scheme. Acceptable values of the slope 
are in the range 1.1 < /? < 1.9, consistent with our estimate of 
I3y-z = 1.8 ± 0.3. 

|Ettori et al.| ( |2004^ noted a significant negative evolution in 
the Lx-A7a relation ( 7 ^ ~ —0.94) in a sample of local and high- 
redshift galaxy clusters extracted from a large cosmological hy- 
drodynamical simulation with cooling and preheating, supporting 
the first evidence of a negative evolution (with respect to the self¬ 
similar model) observed in Chandra data of high -2 clusters in |Et-| 
|tori et al^ ( |2004b| >. This negative evolution with 2 is in agreement 
with our findings (72 = —1.7 ± 0.8), even though the large sta¬ 
tistical error make the estimate marginally compatible with self¬ 
similarity. We noted negative evolution in two different samples 
of X-ray clusters, i.e, the sample from |Maughan et ^ ( |20 1 2) and 
the MCXC, whose luminosities were largely based on independent 
data-sets and procedures. 

A exhaustive study of the joint effect of feedback from super¬ 
novae (SNe) and active galactic nuclei (AGNs) on the evolution of 
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Figure 7. Top panel: completeness function of the clusters from the WL- 
X-ray luminosity sample. Lines and points are as in the top panel of Fig. 
Bottom panel: scaling between (self-similarly redshift evolved) X-ray holo- 
metric luminosity and mass M 500 . Black, blue, and green graphics are as 
in the bottom panel of Fig.|^ 


the X-ray scaling laws presented in [Short et al.| ( |20l0) l predicts an 
opposite behaviour of the evolution of the Lx-Ma relation. They 
found that the energy output from SNe and AGNs as implemented 
through semi-analytic models of galaxy formation causes a positive 
evolution. On the other hand, simulations based on a pre-heating 
model where an entropy floor of 200 keV cm^ is introduced at 
2 = 4 confirmed the presence of a negative evolution ([Short et al.| 

|MT^ . 

Positive evolution (i.e., higher luminosities at higher redshift, 
for a fixed mass, than the self-similar prediction) was also found 
in [Pike et al.| ( |2014| l in a series of radiative hydrodynamical mod¬ 
els, which suggested that radiative cooling is the main driver for 
departures from self-similarity. 

The pre-heating scenario seems to be preferred from our re¬ 
sults. 


6.1.2 Ysz-M^ 

The consensus from numerical simulations is that the Ysz-Ma is 
approximately self-similar in mass ( 1.6 < /3 < 1 . 8 ) and it is char¬ 
acterised by a small intrinsic scatter 1 Stanek et al.|20 10[ |Kay et ^ 
|2012[ [Battaglia et al.|2012| l. |Stanek et al. ( [2010| ( found that the evo¬ 
lution with redshift might be negative ( 7 ^ ~ —0.34) in presence of 
cooling and preheating. 


Figure 8. Top panel: completeness function of the clusters from the WL-SZ 
sample. Lines and points are as in the top panel of Fig.[^ The black line 
represents the 50 per cent completeness limit of the full SZ sample as de¬ 
rived in |Planck Collaboration et ^j2014^ from the average noise over the 
sky for the MMF3 algorithm. Bottom panel: scaling between (self-similarly 
redshift evolved) spherical SZ flux and mass, M 500 . Black, blue, and green 
graphics are as in the bottom panel of Fig. [^ The red line represents the 
result from |Planck Collaboration et al.|j2014b) at the median redshift of the 
sample. 


In agreement with simulations, we did not detect any departure 
from the self-similar scaling, with fiy-z = 1-50 ± 0.21. The uncer¬ 
tainty in the observed time-evolution is too large ( 7 ^ = — 0 . 8 ± 0 . 8 ) 
to infer any statistically significant deviation from self-similarity. 
In fact, the evolution with mass is fully consistent with the findings 
of |CoMaLit-II[ where we found fly-z = 1-37 ± 0.15 assuming a 
self-similar redshift evolution. 


6.1.3 (Jv-Ma 

Numerical simulations confirmed that the a^-MA relation is con¬ 
sistent with the self-similar scaling with mass ([ Evrard et al.|2008[ 
[Munari et al.|2013l|Saro et al.poisy Some differences may arise 
from the galaxy population used to estimate the velocity disper¬ 
sion and from the impact of selection using galaxy colour, projected 
separation from the cluster centre, galaxy luminosity, and spectro¬ 
scopic redshift ( [Saro et al.|2013^ . Whereas dark-matter particles in 
simulations trace a relation that is fully consistent with the theo¬ 
retical expectations, sub-haloes and galaxies trace slightly steeper 
relations with /? just above 1/3, and with slightly larger values of 
the normalisation jMunari et al.|2013) . This is due to dynamical 
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Figure 9. Mass function of the clusters from the ctv-Mwl sample in four 
redshift bins. The black histogram groups the observed WL masses. The 
blue line is the normal approximation estimated from the regression at the 
median redshift. The shaded blue region encloses the 68 per cent confidence 
region around the median relation due to uncertainties on the parameters 
of the mass function. The mass function for the observed WL masses is 
estimated from the fit result, i.e., the mass function of the true masses, by 
smoothing the prediction with a Gaussian whose variance is given by the 
quadratic sum of the intrinsic scatter of the (logarithmic) WL mass with 
respect to the true mass and the median observational uncertainty on the WL 
mass. Redshift increases from the top to the bottom. The median redshift 
and the redshift bins are indicated in the legends of the respective panels. 


Figure 10 . Mass function of the clusters from the optical richness-WL sam¬ 
ple in four redshift bins. Lines and conventions are as in Fig. [9] 

processes, namely dynamical friction and tidal disruption, which 
act on substructures and galaxies, but not on dark matter particles. 
The relevance of these effects depends on the halo mass and the 
effectiveness of baryon cooling, and may create a non-trivial de¬ 
pendence of the scaling relation on the tracer, the halo mass, and its 
redshift < |Munari et aL|2Q13t . A better statistical accuracy than that 
achieved with our results is needed to detect such effects. 

|Saro et aT| p013| t noted a substantial agreement between the 
time evolution of the (Jv-Ma relation and the expected self-similar 
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Figure 11. Mass function of the clusters from the X-ray luminosity-WL Figure 12. Mass function of the clusters from the SZ-WL sample in four 

sample in four redshift bins. Lines and conventions are as in Fig.|^ redshift bins. Lines and conventions are as in Fig.|^ 


evolution, 72 ~ 0, which we confirm here within the statistical 
uncertainties. 

The main sources of bias and scatter in velocity dispersion 
at fixed mass are the halo triaxiality, sampling noise, the pres¬ 
ence of multiple kinematic populations within the cluster, and the 
effect of interlopers ( |Saro et ^|2013[ ). |Saro et ah] j2013^ found 
o'iog(o-v/MA) ~ 0.05 locally, and that the intrinsic scatter in¬ 
creases with redshift, with velocity dispersions that are ~25 per 


cent less accurate for estimating single cluster masses at 2 = 1 
than at low redshift. [Stanek et al.lpOlOl l found a smaller scatter of 
'^iog(tT,,/MA) ~ 0.02 for dark matter particles. 

Our findings, i.e.,(Tiog(o-,,/MA) ~ 0.06±0.02, slightly exceed 
the theoretical predictions. The accuracy of our results is not good 
enough to appreciate any evolution of scatter with redshift. 
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6.2 Previous estimates 

We now discuss our findings in relation to previous results. We lim¬ 
ited the comparison mainly to scaling relations which employed di¬ 
rect mass measurements based on weak lensing or the assumption 
of hydrostatic equilibrium, whereas we mostly discarded works 
based on mass estimates based on external calibration. Previous 
analyses of the CTv-A/a were mostly based on mass measurements 
assuming the dynamical equilibrium and the virial theorem ( |Girardi| 
|& Mezzetti|2001[|Rines et al.|2013^ . This mass measurement is di¬ 
rect too, but it is strongly correlated with the velocity dispersion, 
differently from the weak lensing masses we considered here. 

We did not consider previous analyses of the conditional scal¬ 
ing relations in which the mass worked as the response variable. 
These relations are not easily inverted and cannot be compared 
straight on with our results. 


6 . 2.1 Z/x-fffA 


Observed slopes of the Lx-Ma relation from previous studies may 
be steeper than the self-similar prediction. Estimated values of /3 
ranges from 1.3 to 1.9 jVikhlinin et aL |2009 Pratt et aL]|2009[ 


|Amaud et al.||201^ [Reichert et al.|20TT [Ettori |2013[ ). S^rce of 

disagreement may be various. The slope and normalisation of the 
relation depend on the energy band and method used for the flux 
extraction ( [Ettori|2015[ l. 

The intrinsic scatter of the Lx-Ma relation is ~40 per cent 
( [Giodini et al.[201^ . It is the largest among the various X-ray scal¬ 
ing relations. We confirmed the large scatter in the Lx-Ma rela¬ 
tion. The X-ray luminosity is heavily affected by non-gravitational 
processes, the presence of cool-cores, and the overall dynamical 
state of the halo ( [Giodini et al.|2013l l. Most of the scatter derives 
from the inner regions where cooling and merging effects are most 
pronounced. Our estimate of the scatter based on the the soft band 
luminosities provided by the MCXC is fully consistent with most 
of the previous results (~ 40 per cent), whereas the estimate based 
on the core-excised bolometric luminosities from [Maughan et"^ 
( [2012^ is significantly smaller (~ 20 per cent). This suggests that 
a careful choice of the energy band and of the methods for the flux 
measurement might significantly reduce the intrinsic scatter of the 
Lx-Ma relation. 

A negative time-evolution of the relation was firstly noticed in 
[Ettori et P004b} and later confirmed by [Reichert et al.[ ( f2011[ (, 
which found 72 = —1.3 ± 0.2 applying a tentative selection-bias 
correction. Our results confirm these previous findings. 

From a multivariate analysis aimed to study X-ray luminos¬ 
ity, temperature, and gas mass fraction in a sample of clusters with 
well measured WL masses in the context of cosmological parame¬ 
ter determinations with cluster abundances, [Mantz et al.[j2014^ es¬ 
timated a slope of 1.71 ± 0.17 and an intrinsic scatter of 42 ± 5 per 
cent for the Lx^^^^-Ma relation assuming self-similar time evolu¬ 
tion. Apart from the assumed redshift dependence, these results are 
directly comparable to ours, since [Mantz et al.|j2014) considered 
the scatter of the WL mass and the effects of the selection func¬ 
tion. The estimated slope is in very good agreement with our result 
Py\z = 1.60±0.27. In principle, slopes obtained from a multivari¬ 
ate analysis may differ from the results of a single O-Ma relation 
if the considered observables are strongly correlated ( [Ettori|2013[ (. 
However, this is not the case of the X-ray properties considered in 
[Mantz etd^ \20\A) . 


Rozo et ^([2014^ developed a self-consistent method to de¬ 


rive scaling relations satisfying optical data from SDSS, X-ray data 


from ROSAT and Chandra, and SZ data from Planck. Assum¬ 
ing a self-similar time-evolution, they derived a slope of I3y\x = 
1.55±0.09 for the Lx^^^-Msoo relation with a scatter of ~ 39 ±3 
per cent. Slope and scatter from [Rozo et al.[ ( [2014a^ are consistent 
with our results for the Lx^^^-Ma relation based on the MCXC 
(see Sec. even though the analyses presents some major differ¬ 
ences. In fact, [Rozo et ^ used stacked data rather than 

measurements from single clusters and they assumed a self-similar 
time evolution. 


6 .2.2 Ysz-Ma 


Observed slopes of the scaling relation between mass and SZ flux 
are discordant to some degree jCoMaLit-II[ (. The Planck team de¬ 
termined the Ysz-Msoo relation relying on masses estimates based 
on the Yx proxy and assuming a self-similar time-evolution ( [Planck [ 
[Collaboration et al.|2014b^ . Through the BCES-orthogonal regres¬ 
sion, they found Py-x = 1.79 ± 0.06 and an intrinsic orthogonal 
scatter ~ 15-20 per cent (Planck Collaboration et al.|2014b[ (. A pre¬ 
vious calibration based on 19 weak lensing clusters mainly from the 
LoCuSS sample (Local Cluster Substructure Survey, [Okabe et ^ 
[2010[ l gave Py-x = 1.7 ± 0.4 ( [Planck Collaboration et al.|2013[ l. 
However, weak lensing masses of the LoCuSS clusters are biased 
low due to contamination effects and systematics in shape mea¬ 
surements (Okabe et al.|2013) (. The underestimate might be mass 
dependent and affect the estimated slope. With a self-consistent 
method, Rozo et al. j2014ai found a slope of Py[x = 1-71 ± 0.08 
with a scatter of ~ 15 ± 2 per cent. These estimated slopes are 
steeper but consistent within the statistical uncertainty with our re¬ 
sult of = 1.5 ±0.2. 

[Andreon](2014( claimed that the evolution of the Ysz-M^oo 
relation is significantly inconsistent with the self-similar evolution. 
He found 72 = 1.8 ± 0.4. The disagreement with our result, which 
is fully consistent with the self-similar prediction, might be due 
to the arbitrary choice in [ Andreoit] ( [2014^ to use a pre-determined 
completeness function. An inappropriate modelling can bias the es¬ 
timates of the scaling relations as well as neglecting time-evolution 
at all. 

Furthermore, [Andreon] (2 014[ l neglected the intrinsic scatter 
in the mass estimate, and, following [Planck Collaboration et al.| 
( [2014b[ l, he employed a mass proxy based on Yx, which is strongly 
correlated to the SZ signal. These choices likely explain the dis¬ 
agreement with our result. 


6.2.3 X-Ma 


Most of the analyses correlating optical richness to mass were 
based on stacking techniques (Rozo et al.[[2014^ [Covone et al.[ 
2014[ and references therein). Here, we focus on not binned data. 


Wen, Han & Liu[(2009[ l considered a compilation of clusters whose 
masses had been estimated by X-ray or weak-lensing methods to 
infer a slope of Py-z ~ 1.17 ± 0.03. 

[Andreon & Congdon[(2014t studied the mass-richness scal¬ 
ing for a sub-sample of the CCCP clusters (The Canadian Cluster 
Comparison Project, [Hoekstra et al.|2012^ . They found that rich¬ 
ness scales almost linearly with the projected weak-lensing mass 
(Py\x = 1-3 ± 0.3), with a statistically insignificant evolution 
(72 = —0.7 ± 0.7). The evolution with mass measured in Andre( 

& Congdon (2014^ is steeper than a previous result by Andreon 
& Berge ( 2012| , which used spherical masses and projected rich¬ 
nesses to find pY\x = 0.46 ± 0.12. 
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The slower slope found in [Andreon & Berg6| ( |2012[ > might 
be due to their assumption that the masses, which were measured 
with the caustic method, were unscattered measurements of the true 
masses. However, masses based on this method are highly scat¬ 
tered I CoMaLit-II^. Negl ecting this effect makes relations flatter 
( [CoMaLit^ CoM^it-lI^. The very l arge scatter of ~ 58 ± 7 per 
cent measured in Andreon & Berge] \2012) is biased high for the 


same reason. 


7 CONCLUSIONS 

To assess the role of the evolution with time of the scaling relations 
between mass and observed properties it is crucial to avoid biases 
in the calibration. The evolution of cluster scaling relations is still 
debated. Small samples or biases due to the (unknown) selection 
function are two of the main problems. We developed a regression 
methodology that at the same time constrains the evolution and cal¬ 
ibrates the completeness function of the studied sample. 

The method is general and lets the data determine the time- 
dependent scatter of the mass proxy or the intrinsic scatter between 
the observable and the mass. Selection functions and their time- 
dependence are often not known a priori. In our approach, the selec¬ 
tion function can be determined in the context of the regression pro¬ 
cedure. The approach we implemented is Bayesian and can easily 
include any additional or a priori information on the completeness 
of the sample. This method is functional in the context of large pho¬ 
tometric surveys such as Euclid ( [Laureijs et al.|20rT| , where self¬ 
calibration of scaling relations is crucial to unbiased estimates of 
dark energy with study of cluster abundances ^Majum dar & Mohr| 
[2004l l 

We tested the method with large heterogeneous samples to cal¬ 
ibrate either optical properties, such as richness and velocity disper¬ 
sion, or observables connected to the intra-cluster medium, such as 
X-ray luminosity and SZ flux. Masses were estimated with weak 
lensing analyses and intrinsic scatter in the mass estimate was con¬ 
sidered. Weak lensing masses are reliable mass measurements up 
to high redshifts. 

To our knowledge and not considering mass estimates based 
on the viral theorem, this is one of the first studies to compare 
galaxy velocity dispersions to direct estimates of masses of clus¬ 
ters (weak lensing masses in our case). This is the approach usually 
followed in numerical simulations jEvrard et al.|2008]|Munari et al.| 
|2013[[S^o et al.|2013| l to built mass proxies based on the velocity 
dispersion without assuming dynamical equilibrium and without 
exploiting the properties of the infall patterns. 

We found that observables scale self-similarly with respect to 
the mass and that they evolve self-similarly with cosmic time. The 
only exception is the Lx-ATa relation, which seems to show a neg¬ 
ative evolution. A similar level of evolution can be obtained in hy- 
drodynamical simulations of the intracluster medium by including 
additional radiative cooling and uniform preheating at high redshift 
as a simple model of non-gravitational heating from astrophysical 
sources ( [Ettori et al.|2004^|Short et al.|2010l [Reichert et al.|201 1| >. 

The intrinsic scatter in the mass-velocity dispersion relation 
is notably small, which encourages the use of velocity dispersions 
as mass proxies. Our procedure accounts for time evolution of the 
intrinsic scatter. However, the large statistical uncertainties made 
the parameters characterising this dependence as de facto noise pa¬ 
rameters which had to be marginalised over to avoid to bias low 
the uncertainties on the scaling relation. The determination of this 
evolution is out of reach in present samples of nearly one hundred 


of clusters ( |Mantz et al.|20ld^ , but is should be feasible in future 
large surveys that will be able to collect homogeneous multi-band 
information of an unprecedented number of clusters. 
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Figure Al. Distribution of true masses M 200 of ^ sample of clusters at 
2 = 0.3 (grey histogram) whose observable proxy mass was larger than 
-^th ,200 = 0 .5x10^'^ M 0 //i(vertical blue line).The assumed log-normal 
scatter is (j = 0.25. The empty histogram represents the full distribution of 
the true masses of all clusters before the selection; the grey histogram repre¬ 
sent the distribution of selected clusters, i.e, clusters whose observed proxy 
mass is above the threshold (vertical blue line); the red line is the Gaussian 
approximation to the mass function.. The true masses were extracted from 
a cosmological halo mass function following|Tinker et al.H2Q0'8). 



Figure A2. Probability function of the log-mass /i, i.e., the logarithm of 
the over-density mass Ma- We assume that fi is intrinsically distributed as 
a local power-law and that clusters are selected if they have a proxy mass 
o > — 1. The value of the slope of the halo function /3i, and of the intrinsic 
scatter for the different cases are reported in the legend. 
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APPENDIX A: THE MASS EUNCTION 

The mass distribution of an observationally selected sample of clus¬ 
ters is usually limited at large masses by the steepness of the mass 
function, and, at smaller masses, it is limited by some observa¬ 
tional thresholds they have to surpass to be detected/selected. This 
two factors conjure to make the distribution of the masses approxi¬ 
mately log-normal ( [Lima & Hu|2005^ . 

As an example, we extracted a number of clusters from the 
cosmological halo mass function ( [Tinker et al.|2008y The clusters 
were then selected if their observable proxy mass was in excess of 
a given threshold value. We assumed that the proxy masses were 
unbiased, i.e., the expected value of the proxy for a given mass 
is exactly the mass, but (log-normally) scattered with respect to 
the true masses. The final distribution, which is plotted in Fig. 
resembles a normal function. 

The mass function of an observed sample can be expressed as 


dn 


sample 




= x(m) 


dn 
d^ ’ 


(Al) 


where fi = log Ma and dn/dfi is the number density per logarith¬ 
mic interval. The completeness of the observed sample x can be 
usually approximated as 




(A2) 


where erfc is the complementary error function. 

Some considerations based on a toy-model can give us a better 
insight. The mass function in small mass and redshift ranges can be 
approximated using a first-order Taylor expansion as (jRozo et al.j 
|2014b||E^d et al.|2014) . 


gocexp(-/3rp). 


(A3) 


At large masses, the mass function is steep and the function in 
Eq. ||A3) provides a good approximation. Fitting the approximation 
to the mass function proposed by [Tinker et al.j ( [2008) , we found 
that at 2 ~ 0.3 (1.0), /3i ~ 6.8 (11.3) around a pivot mass of 
M 20 Q = 5 X IO^^Mq/Ii for the standard ACDM cosmology. 

Previous treatments have approximated the mass function of 
the sample with the cosmological halo distribution in Eq. (|A3) 


( Mandelba^&jeljak [2TO7 [Allen, Evrard & Mantz|20TT] [Rozo[ 

[et al.[[2014b[ [Evrard et al.[[2014[ and references therein). Here 
we want to consider the additional effect of the selection function, 
which severely limits the number of low mass halos. Let us assume 
that the observable O is log-normally scattered around the mass. 
The conditional probability for the log-variable o = log O is 


p(o|m) = -exp 

To simplify the notation, we assumed that O scales linearly with 
the mass and that is was normalised so that the expected value of o 
for a given mass is exactly fi. 

If we select a cluster sample imposing a hard cut in the observ¬ 
able, o > Oth, the resulting mass function of the selected clusters 
is 


.1 ( ^ 
2 V 


(A4) 


p{^i) = yerfc 


Oth — M 


y2c 


o|m 


exp 



— Oth -f 



(A5) 


By comparison with Eq. jAl) , we see that in this case the complete¬ 
ness function is exactly given by the complementary error function 
with = Oth and = o'oj^. 

The probability distribution of the observable is 

p{o) = Pi exp [-^1 (o - Oth)], (A6) 

for o > Oth, and p(o) = 0 below the threshold. The two- 
dimensional distribution is 


p{p.,o) = Pi 


exp 


■\/27rf7o|^ 

X exp 


.1 ( 

2 V ^o\i. 


n I I 2 

— Pi ( p — Oth -b 


(A7) 


for o > Oth, and it is null otherwise. 

The mass function of the selected clusters in Eq. ( [Al) , and 
its approximation in Eq. can be adequately described by a 
Gaussian distribution. The effectiveness of the Gaussian structural 
model for estimating the regression parameters was illustrated by 
[Kelly[ ( [?007) , who showed that as far as the distribution of the co¬ 
variate variable, i.e., the mass function in our case, is fairly uni- 
modal a simple Gaussian can perform competitively with a mix¬ 
ture. Eor a large range of scatters, 0.05 < < 0.25, and slopes, 

5 S /3i ^ 20, we found the following analytical approximation: 


pM 



exp 


1 f p-p y 
2 Wm / 


with 


(A8) 


p cri — l.2iPia^, (A9) 

and 


~ 0.062 - 0.0023ft + (1 - 0.0052ft)a^, (AlO) 

Equations ( [A9) and ( jATOl l can be used to approximately estimate 
the completeness function, Eq. jA2) , if we know the mass distribu¬ 
tion of the clusters, Eq. jA8) . The larger the intrinsic scatter and 
the steeper the mass function, the better the Gaussian approxima¬ 
tion, see Fig. |A2| Even for small scatters ((Jo|^ ~ 0.15) and shallow 
mass functions (Pi ~ 5), the normal approximation is reliable. 


APPENDIX B: CATALOGS OE VELOCITY DISPERSION 

We compiled two catalogs of galaxy clusters with measured veloc¬ 
ity dispersion, the Sigma Catalogs (SCs). SC-all comprises the full 
body of information. Multiple entries are present. The SC-single 
is a subsample with unique entries. When a cluster had multiple 
analyses available in literature, we picked for the SC-single the re¬ 
sults based on the larger number of identified member galaxies with 
confirmed spectroscopic redshift, Wmembers. 

In each catalog, objects are ordered by right ascension. The 
format of the catalogs is as follows. 

Cols. 1-2: name of cluster as designated in the original paper. 

Cols. 3-4: right ascension RA (J2000) and declination DEC 
(J2000), as quoted in the original paper. If coordinates are not 
quoted in the source paper or in a companion one, I reported the 
coordinates of the NED’s association. 

Col. 5: redshift z, as reported in the original paper. 

Col. 6: external validation through NED. ‘N’: the NED’s object 
was associated by name; ‘P’: the NED’s object was associated by 
positional matching; ‘NA’: no found association. 
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Cols. 7-11: as in cols. 1-5, but for the NED’s association. 

Col. 12: author code. 

Col. 13: bibliographic code from NASA’s Astrophysics Data 
System (ADS). 

Col. 14: Amembers, number of confirmed member galaxies used 
to measure the velocity dispersion. If the number was not available, 
we put the entry to —99. 

Col. 15: aperture radius within which member galaxies were 
looked for, in units of Mpc/fi. The radius is measured in a flat 
ACDM model with Dm = 0.3. If the information is not available, 
we put the entry to —99. 

Col. 16: line-of-sight velocity dispersion Cv, in units of km/s. 

Col. 17: uncertainty on the line-of-sight velocity dispersion, in 
units of km/s, as quoted in the reference paper, 5^^ . 

Col. 18: Standardised uncertainty in the velocity dispersion 
in units of km/s, measured as 


0.92crv 


S/Kn 




(Bl) 


or fixed to -99 if Amembers IS uuknown. 


The format of columns 1-13 follows the LC^ iCoMaLit- 


The catalogs are publicly available at http: / /pico . bo . 
astro . it/-sereno/CoMaLit/sigma/ and they will be pe¬ 
riodically updated. 


© 0000 RAS, MNRAS 000, 000-000 






